Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add security foundations section #487

Open
wants to merge 22 commits into
base: main
Choose a base branch
from
Open

Conversation

coroiu
Copy link
Contributor

@coroiu coroiu commented Nov 25, 2024

🎟️ Tracking

📔 Objective

This ports over an internal document while making some adjustments to the principles. Requirements will initially be added as-is

⏰ Reminders before review

  • Contributor guidelines followed
  • All formatters and local linters executed and passed
  • Written new unit and / or integration tests where applicable
  • Protected functional changes with optionality (feature flags)
  • Used internationalization (i18n) for all UI strings
  • CI builds passed
  • Communicated to DevOps any deployment requirements
  • Updated any necessary documentation (Confluence, contributing docs) or informed the documentation
    team

🦮 Reviewer guidelines

  • 👍 (:+1:) or similar for great changes
  • 📝 (:memo:) or ℹ️ (:information_source:) for notes or general info
  • ❓ (:question:) for questions
  • 🤔 (:thinking:) or 💭 (:thought_balloon:) for more open inquiry that's not quite a confirmed
    issue and could potentially benefit from discussion
  • 🎨 (:art:) for suggestions / improvements
  • ❌ (:x:) or ⚠️ (:warning:) for more significant problems or concerns needing attention
  • 🌱 (:seedling:) or ♻️ (:recycle:) for future improvements or indications of technical debt
  • ⛏ (:pick:) for minor or nitpick changes

Copy link

github-actions bot commented Nov 25, 2024

Logo
Checkmarx One – Scan Summary & Details4d5c2146-db34-4f35-b103-3a29f766c67e

No New Or Fixed Issues Found

Copy link

cloudflare-workers-and-pages bot commented Nov 26, 2024

Deploying contributing-docs with  Cloudflare Pages  Cloudflare Pages

Latest commit: 528330f
Status: ✅  Deploy successful!
Preview URL: https://8a130d8c.contributing-docs.pages.dev
Branch Preview URL: https://ps-add-fundamental-reqs.contributing-docs.pages.dev

View logs

@coroiu coroiu requested a review from MGibson1 November 26, 2024 10:35
@coroiu coroiu marked this pull request as ready for review November 26, 2024 10:35
@coroiu coroiu requested a review from a team as a code owner November 26, 2024 10:35
Comment on lines +7 to +11
The requirements in this section are organized hierarchically, with top-level requirements defining
the core rules and obligations that must be met, serving as broad objectives for Bitwarden's
security model. Sub-requirements expand on these by addressing specific scenarios, exceptions, or
clarifications, and may override their parent requirement when explicitly stated. Sibling
requirements at the same level must remain independent and free of contradictions.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thought: Might be nice to align this to an industry standard like ISO/IEC/IEEE 29148, but there's only so much we can do in a markdown document

2. Vault data **MAY** be unprotected while _in use_.

- a. The Client **MAY** decrypt all vault data during vault unlock.
- i. The Client **SHOULD** minimize the quantity of decrypted vault data.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thought: we could make this more stringent

Suggested change
- i. The Client **SHOULD** minimize the quantity of decrypted vault data.
- i. The Client **SHOULD** limit decrypted data to the records actively in use by the user.

@@ -0,0 +1,21 @@
# P03 - No security on fully compromised systems
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

point-of-interest: this was previously called No guarantees for unlocked vaults on fully compromised devices. I changed it to make it more consistent with the other principles, but "no security" might seems a bit harsh

Copy link
Contributor

@quexten quexten Dec 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Related to the confluence point; I think we might need to add "no security guarantees on fully compromised bitwarden contexts"

@withinfocus
Copy link
Contributor

Thoughts on dropping the acronyms and typed declarations? "P01" and "EK" come off rather strongly to me. Same for the "MUST" and "MUST NOT" all over the place that could probably just be expressed without capitalization and boldface.

@coroiu
Copy link
Contributor Author

coroiu commented Nov 27, 2024

@withinfocus for P01 I thought about doing something similar to ADRs. I also wanted to be able to connect requirements to principles somehow later down the line in which I think the numbering could be useful for.

The boldface might be distracting you're probably right, but I'm not sure about removing the capitalization. Here I'm drawing inspiration from W3C specifications such as WebAuthn L3 where those terms are always captialized

@withinfocus
Copy link
Contributor

I get where you're coming from with the historical references, but I see those as specifications with a large scope and impact. This is important to the company as principles, but I wouldn't treat it quite so intensely such as to model it vs. writing natural language.

@quexten
Copy link
Contributor

quexten commented Nov 28, 2024

Imo:

I prefer the RFC-like MUST NOT, and SHOULD NOT, etc. because it makes it clear that we are unambiguous and very deliberate about whatever meaning we encode with them, because they have a clear definition (from rfc2119).
Natural text, even if it's the same words, does not do that. In natural text, "must not" does not necessarily mean an absolute prohibition, and could just be a strong recommendation.

So, if we want to treat this as a set of requirements, that we want to design our applications by, then the RFC-style terms helps make these requirements unambiguous and clear.
(If we are just stating principles as recommendations/guidelines, without a strict requirement, then it does not matter.)

@coroiu
Copy link
Contributor Author

coroiu commented Nov 28, 2024

My vision was that these requirements would be what we base all of our security design on, and not be "mere guidelines". If that will happen or not is probably beyond the scope of this PR. :) That said, the primary reason that motivated me to create this documentation was to remove ambiguousness surrounding security design and make it very clear for teams that have to design solutions for our products. Given that I am hesitant to make any changes that could potentially decrease clarity and introduce ambiguousness

Copy link
Member

@eliykat eliykat left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice work! A few comments from an AC perspective.

Comment on lines 23 to 24
<dt>User</dt>
<dd>The owner of the vault data.</dd>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. Does a User include an organization?

  2. Who is the owner of vault data? For example - organizations often consider themselves the "owner" of data stored in an individual user's vault if it's a business account. But we generally mean "the User or Organization linked to the cipher in the database". This will increase as we give organizations more control over individual accounts (e.g. password reset, delete).

Copy link
Contributor Author

@coroiu coroiu Dec 10, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tweaked this a little bit to avoid the question of ownership

docs/security/definitions.mdx Show resolved Hide resolved
docs/security/principles/01-locked-vault-is-secure.mdx Outdated Show resolved Hide resolved
docs/security/requirements.mdx Show resolved Hide resolved

- a. The Client **MUST** encrypt vault data stored on disk.
- b. The Client **MUST** use the UserKey to encrypt vault data stored on the Server.
- c. The Client **SHOULD** ensure that no mechanisms exists such that vault data may be stored to
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

typo:

Suggested change
- c. The Client **SHOULD** ensure that no mechanisms exists such that vault data may be stored to
- c. The Client **SHOULD** ensure that no mechanisms exist such that vault data may be stored to

- a. The Client **MAY** decrypt all vault data during vault unlock.
- i. The Client **SHOULD** minimize the quantity of decrypted vault data.
- b. The Client **MAY** leave vault data encrypted after unlock.
- c. The Client **MUST** ensure that unprotected data does not linger when no longer _in use_.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What do we mean by linger? This is all very precise but "linger" is quite vague.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmmm, I think is not present in memory is what I'm looking for here, but that makes this requirement hard to implement. Maybe we should change it to a SHOULD at the same time

Suggested change
- c. The Client **MUST** ensure that unprotected data does not linger when no longer _in use_.
- c. The Client **SHOULD** ensure that unprotected data is not present in memory when no longer _in use_.

I still think this requirement will have a pretty large impact if adopted since it tends to discourage our huge caches, but having no caches is the current approach in the SDK so hopefully we won't have to prioritize any additional work just for this

Copy link
Member

@eliykat eliykat Dec 4, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree with this; being more specific with the requirement while loosening the language a little (MUST -> SHOULD) seems like a good balance.

2. The UserKey **MUST** be protected _at rest_.
3. The UserKey **MAY** be unprotected while _in use_.
4. The UserKey **MUST** be protected while _in transit_.
5. The UserKey **MUST NOT** be shared.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What about account recovery and emergency access?

Suggested change
5. The UserKey **MUST NOT** be shared.
5. The UserKey **MUST NOT** be shared without _informed and explicit consent_ by the User.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You're actually the second person to give me this exact comment so there is definitely room for improvement here. Account recovery and emergency access is not "sharing" according to the definition:

Data sharing
The controlled process in which data leaves the Bitwarden secure environment unprotected. As a consequence the guarantees made by this document will no longer apply. The receiving party may or may not have its own guarantees.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Interesting. Given that "sharing" has a common use, maybe there is a more specific term that can be used here? (Maybe "export" - or would that be confused with a vault export?)

Alternatively, it's only used a couple of times - maybe you remove the definition and be explicit each time: "The UserKey MUST NOT leave the Bitwarden secure environment in an unprotected state."


As a separate question - does this document have anything to say about when data is exchanged/accessed between users then? EA and Account Recovery are within the Bitwarden secure environment, but it has security implications.

Copy link
Contributor Author

@coroiu coroiu Dec 4, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Both very good suggestions. I think we might need the definition just to make the document easier to read, but you are right that it might be mixed up with vault export. Still think its worth it though, I'll update it when I get a chance!

does this document have anything to say about when data is exchanged/accessed between users then? EA and Account Recovery are within the Bitwarden secure environment, but it has security implications.

Not yet. I think we should redefine this to be "Sharing" and then start writing some principles and requirements on it

Comment on lines 3 to 4
Clients must ensure that once the vault has been locked, no vault data can be accessed in plain
text, even if the device becomes compromised after the lock occurs. Protections are not guaranteed
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What about vault data that is intentionally stored in plain text? e.g. modification dates, folder guids, boolean values. Or is this excluded by the definition of "vault data"? (is metadata "sensitive and private information"?)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, there is actually still an open thread in confluence about this because I was hoping to get Vault involved in classifying data a bit better. My idea is to split up the vault data into groups of varying sensitivity. In that case we might get:

  • High: Password
  • Moderate: Username
  • Low: Creation date

And then say something like "low sensitivity data" may be stored unprotected

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think that would work!

Copy link
Contributor Author

@coroiu coroiu Dec 10, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tweak this a little bit by only grouping the data into highly sensitive and less sensitive (see 528330f) to avoid getting into hard discussions. What do you think?


:::

1. The authentication tokens MUST be protected at rest if the client provides a mechanism for secure
Copy link
Member

@trmartin4 trmartin4 Dec 6, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
1. The authentication tokens MUST be protected at rest if the client provides a mechanism for secure
1. The authentication tokens **MUST** be protected at rest if the client provides a mechanism for secure

Suggestion to maintain consistency of font weight.


1. The authentication tokens MUST be protected at rest if the client provides a mechanism for secure
storage.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
2. The authentication tokens **SHOULD** be protected in transit.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants